A Low-Power Text-Dependent Speaker Verification System with Narrow-Band Feature Pre-Selection and Weighted Dynamic Time Warping
نویسندگان
چکیده
To fully enable voice interaction in wearable devices, a system requires low-power, customizable voice-authenticated wake-up. Existing speaker-verification (SV) methods have shortcomings relating to power consumption and noise susceptibility. To meet the application requirements, we propose a low-power, text-dependent SV system comprising a sparse spectral feature extraction front-end showing improved noise robustness and accuracy at low power, and a back-end running an improved dynamic time warping (DTW) algorithm that preserves signal envelope while reducing misalignments. Without background noise, the proposed system achieves an equal-errorrate (EER) of 1.1%, compared to 1.4% with a conventional Mel-frequency cepstral coefficients (MFCC)+DTW system and 2.6% with a Gaussian mixture universal background (GMMUBM) based system. At 3dB signal-to-noise ratio (SNR), the proposed system achieves an EER of 5.7%, compared to 13% with a conventional MFCC+DTW system and 6.8% with a GMM-UBM based system. The proposed system enables simple, low-power implementation such that the power consumption of the end-to-end system, which includes a voice activity detector, feature extraction front-end, and back-end decision unit, is under 380 μW.
منابع مشابه
Variable print quality
In the literature, much research work has been done in the area of speaker verification. The developments include: different types of speaker verification techniques, methods for feature extraction, measures for telephone channel compensation, system robustness etc. In contrast, the problem of acoustic feature selection for speaker verification has been relatively neglected. Hence our aim is to...
متن کاملFeature selection for a DTW-based speaker verification system
Speaker verification systems, in general, require 20 to 30 features as input for satisfactory verification. We show that this feature set can be optimised by appropriately choosing proper feature subset from the input feature set. This paper proposes a technique for optimisation of the feature sets, in an Dynamic Time Warping (DTW) based text-dependent speaker verification system, to improve fa...
متن کاملImprovement of speaker verification for Thai language
There are many strategies proposed for speaker verification (SV) system, both in text-dependent (fixed-text) and textindependent (free-text) domains. To convey an appropriate algorithm for Thai speech, several consecutively improvement methods are compared in this paper including the dynamic time warping (DTW) matching and Gaussian mixture model (GMM) based systems. We firstly developed a syste...
متن کاملA DTW-based DAG technique for speech and speaker feature analysis
A DTW-based directed acyclic graph (DAG) optimization method is proposed to exploit the interaction information of speech and speaker in feature component. We introduce the DAG representation of intra-class samples based on dynamic time warping (DTW) measure and propose two criteria based on in-degree of DAG. Combined with (l − r) optimization algorithm, the DTW-based DAG model is applied to di...
متن کاملOn Feature Selection for Speaker Verification
This paper describes an HMM based speaker verification system, which verifies speakers in their own specific feature space. This ‘individual’ feature space is determined by a Dynamic Programming (DP) feature selection algorithm. A suitable criterion, correlated with Equal Error Rate (EER) was developed and is used for this feature selection algorithm. The algorithm was evaluated on a text-depen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016